CDS

Accession Number TCMCG061C54924
gbkey CDS
Protein Id XP_042021011.1
Location join(1692095..1692403,1693865..1694091,1695033..1695100,1695497..1695568,1695743..1696074,1696298..1696366,1696439..1696524,1696620..1696777,1696844..1698596,1698719..1698796,1698877..1698988,1699207..1699257,1699342..1699392)
Gene LOC121768540
GeneID 121768540
Organism Salvia splendens

Protein

Length 1121aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA737421
db_source XM_042165077.1
Definition dentin sialophosphoprotein-like isoform X2 [Salvia splendens]

EGGNOG-MAPPER Annotation

COG_category S
Description Occludin homology domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K11807        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTATGGAGCCTCCGGAAAGATGGGCCGCGGCGGCAAAAGGAGCATTCACGCGCCGCCAACCGGCCGACCTACCCCCGCCAGCCGCCTCTCCATGGGCGGGGGCCCACGCGGTCTCGGCCGGCCGCCGGCATCATCACCGTCGGCTTCCTCGCTGCAGGTGGAGGAGAGCTTCAGCCTTGTTAGGGAGAACCCTTTGAATTTCGGGATGGCAATTAAGCTGACTGCTGACCTTGTGGAGGAGATCAAGCGAGTGGAGGCGCAGGGTGGTGCGGCGCGCATCAAATTCGGCGCTAATGCTAGTGGGAATGTTATTCAAGTAGGGGACAAAACCATCAAATTCACTTGGTCCAGGGAACCAGGGGACTTATGTGATGTATATGAAGAACGTCAGAGTGGTGATGATGGAAACGGCTTGCTTGTAGAGTCAGGAGGCACTTGGCGTAAGGTTAATGTAGAGCGAGAATTAGATGAATCAACTAAAAATCATGTCAAAAGGCTGTCAGAAGAAGCCGAGCGCAAAATGAAATCACGAAAATCAATTGTTTTAGATCATCAGAATCCAGCTATGAAGAATCAAATGAAGGCATTTGCTGCTTCTGAGTCTACTCAATGGAGAAGTTTCAAGAATCGGAAAGAACCTCCTTTCAAGAAGCCAAAATCTGCACCAACTTCAGGTGGACCTCCAAAGTCTGTTTTTAAGCCTGGTTTACCTAAGGGGAGGCTCTCTTCTCGCTCACCTTTGTCTTCCCAACCAGAGCAGCAACTTGGTCCTTCAACATCTCCAATCGGAATTGGTGATTTTGGGAAGGGACAAACTGTTGTCTCGGATTTTGCGGCAACTCAAAATCTGAATAAAGTTTTGAGCTCTGAAAAAGATACAGCCAACAGAAGGAATAGTACCATCAGCGATAGATCTAAATTGAATGTTGAGGCCAAACCTACTGATTTAAAGAGTCGGATTACTTCTCTACTCCTGGAGCATCAGTCTACAGGAATGAGCCTTAAGGCCTTGGAGAGAGCTATTGGAGATCCAATGCCCAACTCGGCTCGGAGAATTGAGCCCATATTGAAACAAATTGCAATTTATCAAGCTCCTGGGAGATATTTCCTAAACACAGGAGTGGAGATGGAAAGCTCCAAGAAGCTACCTCAAAGCAGAAGTTCTCCTGAAGTTAATCGTGACCAATCACCTGCACCTCAGAAGTTTGACCAACATCCTAGGGAAGATCCTAGCATTCACATTAGCACAGATACCAATAATGAACTGGGTGAATTGAATTCAACACCTACTCATCTAGCAAACATCATAGAGAAAATAGGCAGCTCCAGTGATAGTGGAAGTGACAGTGACAGTGATAGTAGTGACAGTGGTAGTGATAGTGGAAGTCCTAGCAAAAGTAAAAGCCGAAGCCCTGTGGGTAGTAGTAATGACAGTGATAGCGATGCATCTTCGAGCAGCAAGCAGGCATCTGATGAGGATTTAGATATTATGAATAGTGATGATGAGAATGAATCCAAGCATAAATTGCAAGATCCTGATAACGACCCTATTGATATAGGGAATTCAGATATTCAAGATCCCTTTGGTGAGATTGAGATTGACATTGAGAAAGACTCCCCTGAATGTGACCATGGTCTGCAAATGCCCCAGGCTAATAATGTACTTGCCGGTAAAGCAGACGAAGAAAGACGAGTCTCTGAGAGAAAAGGATATGAAGAATCAGGTAGCATGGTCAATGATGGCTTCAAACATGGTCAGTCGGATACTCAGGGAAGATCATCAAAGGGAAATTCTAAAAGACGCTCTGATGATAATAATTTGGAAGACAGGGGACACTGCAAAAAGAAGTCAAAGAGTAAAAACTCAACCGAACCAGTTTCGGGGACTATAAACTCTCTTTTTGGCGAGAGCCCTTACAACTCATTATCGCCTAATAGGCCTCTGCAAGGCATTGACACTCAGCCTGTTGACCTAATGGAAGACAGAACCTACAAAAACGATAGACATTATTCCGATCAACAAACAGGTCCTAACCATCAACCAGTATCACATAAACCTGTTTCCGATTCTCAGCCAGTGCAAAGATCTTCTGAAGGTCGTGGATGGACTGAGGCTCCTTCTGGAGAAAAACGAGCTGGTAAACGACCTGGTTTGCTTAATGAGCAACAGCAGTTACAGTTTTCCACCGGAGTTGGAGACAAACATGCACCAATCATAGAACCCAATATCAGAAAATCTGAAATGACAGGTAAGGTGAAGGAAACAGGCCCTAGTTTCAACTCACACAGGGGGTATTCTCCCAAGATTGACATTCCGAGTACTGCCGATAGATCTCCGATGATGAATGGACGGAATGGTGTACTTCGAAGAGAGCAGTCAGACTTAGAGTTGGGTGAATTTCGAGAACCCTCCAATGAGGAAGCTCCAGCATCTAAGAAGCAATTTGAGAGGAAAAATTCGTTTAAACATTTGGAAAACAAACGAAAGGATGCTGAATTCTGGAACTCAGATTTCAGTGGAGGAAAGACTTCTAATAAGATCCCAGCGGATCCGGGAAAAATGTCTTCACCAAATCCAGATGCTGTTATTTCCAGCAATCCAGATGGACCTTATAGGAGGAAGGCCCAAGAAAATTATGTTGATGACCGAACAAGACCTCATCATAGAAGCACCCAACCTCTTGATGCACATCATCAGCCCCAAGTGGATATCAACTCCCAGCAAAATACAGCACCAGAAGTGAGAGGTAAAACTAGATTCGCGGAAGCTGGGATGAGCAGGGGTGCTAGCATTGAAGCCTATGGAGATGCTTCCCGGAAAATTTCAGGTAATTCAATAGAACAGCAGCATGATCCAATACAAGGAGTCGAACCCCGTGCCACTAAGCAAAGCAAGAAACAGAAGTCCAATAGGCCAAGTGTCTCGAATGATAGACGAACAGATGCTTTGACTGGGAGCAATGATAGCCAGCAACAAAAGAAATTGTCTTCTTCAGATGATACCAGTTGTTCATTCACTAAATTTGAGAAGGAGCAGCCTGAGCTGAAGGGTCCTATAAGGGATATTTCTCAGTACAAAGAATATGTGAAGGAATATCAAGAAAAGTACGAGAGCTATTGCTCCTTGAACAAAATATTGGAGTCGTACAGGGATGAGTTTAGTAAACTTGGTAAGGATCTTGAGGCTTACAAAGGTAGGGATGTGAAAAAATATAACGACATCTTGGAGCAGATGAGGTCCTCATTTCGTCAATGTGGGGAGAAACATAAACGCTTGAAGAAAATATTCGTTGTGCTTTATGAAGAGTTGAAGCATTTGAAACAGATGATCAAAGACTATGCCACCTCATATAAAAAGGACTGA
Protein:  
MYGASGKMGRGGKRSIHAPPTGRPTPASRLSMGGGPRGLGRPPASSPSASSLQVEESFSLVRENPLNFGMAIKLTADLVEEIKRVEAQGGAARIKFGANASGNVIQVGDKTIKFTWSREPGDLCDVYEERQSGDDGNGLLVESGGTWRKVNVERELDESTKNHVKRLSEEAERKMKSRKSIVLDHQNPAMKNQMKAFAASESTQWRSFKNRKEPPFKKPKSAPTSGGPPKSVFKPGLPKGRLSSRSPLSSQPEQQLGPSTSPIGIGDFGKGQTVVSDFAATQNLNKVLSSEKDTANRRNSTISDRSKLNVEAKPTDLKSRITSLLLEHQSTGMSLKALERAIGDPMPNSARRIEPILKQIAIYQAPGRYFLNTGVEMESSKKLPQSRSSPEVNRDQSPAPQKFDQHPREDPSIHISTDTNNELGELNSTPTHLANIIEKIGSSSDSGSDSDSDSSDSGSDSGSPSKSKSRSPVGSSNDSDSDASSSSKQASDEDLDIMNSDDENESKHKLQDPDNDPIDIGNSDIQDPFGEIEIDIEKDSPECDHGLQMPQANNVLAGKADEERRVSERKGYEESGSMVNDGFKHGQSDTQGRSSKGNSKRRSDDNNLEDRGHCKKKSKSKNSTEPVSGTINSLFGESPYNSLSPNRPLQGIDTQPVDLMEDRTYKNDRHYSDQQTGPNHQPVSHKPVSDSQPVQRSSEGRGWTEAPSGEKRAGKRPGLLNEQQQLQFSTGVGDKHAPIIEPNIRKSEMTGKVKETGPSFNSHRGYSPKIDIPSTADRSPMMNGRNGVLRREQSDLELGEFREPSNEEAPASKKQFERKNSFKHLENKRKDAEFWNSDFSGGKTSNKIPADPGKMSSPNPDAVISSNPDGPYRRKAQENYVDDRTRPHHRSTQPLDAHHQPQVDINSQQNTAPEVRGKTRFAEAGMSRGASIEAYGDASRKISGNSIEQQHDPIQGVEPRATKQSKKQKSNRPSVSNDRRTDALTGSNDSQQQKKLSSSDDTSCSFTKFEKEQPELKGPIRDISQYKEYVKEYQEKYESYCSLNKILESYRDEFSKLGKDLEAYKGRDVKKYNDILEQMRSSFRQCGEKHKRLKKIFVVLYEELKHLKQMIKDYATSYKKD